Efficient Skyline Computation on Massive Incomplete Data

نویسندگان

چکیده

Abstract Incomplete skyline query is an important operation to filter out pareto-optimal tuples on incomplete data. It harder than due intransitivity and cyclic dominance. analyzed that the existing algorithms cannot process massive data efficiently. This paper proposes a novel table-scan-based TSI algorithm deal with high efficiency. solves issues of dominance by two separate stages. In stage 1, computes candidates sequential scan table. The dominated others are discarded directly in 1. 2, refines another scan. pruning devised this reduce execution cost TSI. By assistant structures, can skip majority phase 1 without retrieving it actually. extensive experimental results, which conducted synthetic real-life sets, show compute

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Skyline View: Efficient Distributed Subspace Skyline Computation

Skyline queries have gained much attention as alternative query semantics with pros (e.g.low query formulation overhead) and cons (e.g.large control over result size). To overcome the cons, subspace skyline queries have been recently studied, where users iteratively specify relevant feature subspaces on search space. However, existing works mainly focuss on centralized databases. This paper aim...

متن کامل

Efficient Progressive Skyline Computation

In this paper, we focus on the retrieval of a set of interesting answers called the skyline from a database. Given a set of points, the skyline comprises the points that are not dominated by other points. A point dominates another point if it is as good or better in all dimensions and better in at least one dimension. We present two novel algorithms, Bitmap and Index, to compute the skyline of ...

متن کامل

Skyline Computation on Commercial Data

• Our data set contains data on 55208 cars [1]. • To each car, 23 attributes are assigned. – correlated (e.g., cylinders and engine size). – anti-correlated (e.g., mileage and registration date). – nearly independent (e.g., mileage and horsepower). • Outliers countervail correlation effects. • Cardinalities differ greatly, e.g.: – 5988 different values for attribute price. – only 17 different v...

متن کامل

Efficient Skyline Computation in MapReduce

Skyline queries are useful for finding interesting tuples from a large data set according to multiple criteria. The sizes of data sets are constantly increasing and the architecture of back-ends are switching from single-node environments to non-conventional paradigms like MapReduce. Despite the usefulness of skyline queries, existing works on skyline computation in MapReduce do not take full a...

متن کامل

Efficient computation of combinatorial skyline queries

Current skyline evaluation techniques are mainly to find the outstanding tuples from a large dataset. In this paper, we generalize the concept of skyline query and introduce a novel type of query, the combinatorial skyline query, which is to find the outstanding combinations from all combinations of the given tuples. The past skyline query is a special abundant when used in decision making, mar...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Data Science and Engineering

سال: 2022

ISSN: ['2364-1541', '2364-1185']

DOI: https://doi.org/10.1007/s41019-022-00183-7